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(54) Method of modifying speech of a call center agent such that it possesses a dialect 



(57) A method and apparatus are provided for ad- 
justing a dialect for an oral presentation provided by an 
agent of an organization to a human target of the organ- 
ization through a communications network. The method 
includes the steps of determining a dialect to be used 



by the agent for communicating with the target, modify- 
ing the dialect of the oral presentation of the agent for 
communicating with the target based upon the deter- 
mined dialect and presenting the modified oral presen- 
tation to the target. 
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Description 

Field of the Invention 

[0001] The field of the invention relates to communi- 
cation systems and more particularly to telemarketing 
devices. 

Background of the invention 

[0002] Telephone devices used for telemarketing are 
generally known. Such devices may operate to process 
either incoming or outgoing calls, in the simplest case, 
an organization controlling the telemarketing process 
may provide a bank of telephones and a corresponding 
group of agents to man the bank of telephones. As calls 
come in, an agent may answer the call and respond to 
concerns of the caller. When an agent is idle, the organ- 
ization may provide an outgoing list of customer con- 
tacts, which the agent may call when the agent is oth- 
erwise not occupied. 

[0003] In more sophisticated organizations, an auto- 
matic call distributor (ACD) may be provided to route 
calls to idle agents. The ACD may be connected to the 
public switched telephone network (PSTN) through one 
or more trunk lines. Often a single telephone number is 
disseminated to the public for purposes of contacting the 
organization. The PSTN, upon detecting the dissemi- 
nated telephone number, routes the calls to the ACD 
through the one or more trunk lines. 
[0004] Along with delivery of a call, the PSTN may al- 
so deliver information about an incoming call along with 
the call. For example, a dialed number identification 
service (DNIS) utility provides a number dialed by the 
caller along with the delivered call. Such a utility is im- 
portant where the organization has a number of depart- 
ments within the organization, each charged with han- 
dling a different subject matter with regard to incoming 
calls. 

[0005] Another utility operating from within the PSTN 
is automatic number identification (AN I). AN I delivers a 
telephone number of a caller along with a call. ANI may 
be used by a ACD to identify a customer at the instant 
of call arrival, instead of after the call is answered by an 
agent. 

[0006] By relying upon such information as DNIS and 
ANI, a controller of the ACD may select an agent who 
is most qualified to answer a call based upon a number 
dialed or upon an identity of the caller. Further, using 
ANI, customer records can be retrieved and displayed 
for the benefit of an selected agent at the same instant 
as call arrival. 

[0007] While telemarketing organizations using ACDs 
are relatively efficient, there is a growing tendency to 
maximize resources within such organizations. ACDs 
are often used to receive and place calls over broad ar- 
eas resulting in an increased workload and the interac- 
tion of agent and customers of different cultures and di- 



alects. 

[0008] Because of the increased competition among 
businesses, it is increasingly necessary for a merchant 
to minimize perceived cultural differences between its 
5 agents and customers. Accordingly, a need exists for a 
means of controlling regional source indicators within 
agent speech. 

Summary 

10 

[0009] A method and apparatus are provided for ad- 
justing a dialect for an oral presentation provided by an 
agent of an organization to a human target of the organ- 
ization through a communications network. The method 
is includes the steps of determining a dialect to be used 
by the agent for communicating with the target, modify- 
ing the dialect of the oral presentation of the agent for 
communicating with the target based upon the deter- 
mined dialect and presenting the modified oral presen- 
ce tation to the target. 

Brief Description of the Drawings 

[0010] 

25 

FIG. 1 is a block diagram of a system for modifying 
the dialect of an oral presentation, shown in a con- 
text of use, in accordance with an illustrated embod- 
iment of the invention; 
30 FIG. 2 is a block diagram of processors and memory 
used by the system of FIG. 1 ; and 
FIG. 3 is block diagram of use of the system of FIG. 
1 under an alternate embodiment. 

35 Detailed Description of a Preferred Embodiment 

[0011] FIG. 1 is a block diagram of an apparatus 10 
for modifying a content (i.e., a dialect) of an oral pres- 
entation of an agent (e.g., a telemarketer), generally in 

40 accordance with an illustrated embodiment of the inven- 
tion. In its simplest form, modification of the agent's oral 
presentation could be described as a filter which is used 
to control and alter a regional source content of an 
agent's voice. The filter may be triggered by any of a 

45 number of factors. For example, source information (e. 
g., ANI or DNIS information from a PSTN) may be used 
as regional indicators. Alternatively, voice recognition 
may be used to analyze and detect a dialect. 
[0012] The system 10 may be connected to a co mm u- 

50 nication system (COMM) 16 through an appropriate 
connection (e.g., a single telephone line or one or more 
trunk lines) 28. The COMM 1 6, in turn, may be connect- 
ed to one or more human targets (e.g., customers, cli- 
ents, supporters, etc.) 12, 14 of the organization. The 

55 target telephone and user of the target telephone will 
sometimes hereinafter be referred to together simply as 
the "TGT 12, 14". 

[0013] Included within the system 10 is a telephone 
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26 that may be used by an agent of the organization (the 
agent and his telephone will sometimes hereinafter also 
be referred to together simply as the "agent 26"). While 
the system 10 is shown as including a single agent 26, 
it should be understood that the system 1 0 may include 
many agents 26. 

[0014] It should also be understood that the system 
1 0 may be implemented in many different forms. For ex- 
ample, the system 1 0 may include a transaction proc- 
essor 18 (e.g., an ACD) supporting a number of con- 
nected agents 26. Further, the COMM 16 and ACD 18 
may be implemented as part of a larger system, such 
as is shown in FIG. 3. To reflect the clearly commercial 
nature of the illustrated embodiment of FIG. 3, the TGT 
12, 14 of FIG. 1 are referred to as customers (CUSTs) 
100-105. Further, the agents 26 may now communicate 
with customers 1 00-1 05 through respective agent sta- 
tions 1 24, 1 26. An agent 26 at an agent station 1 24, 1 26 
may communicate with a customer 100-105 through a 
telephone console 116, 118 or through a terminal 120, 
122 using voice-over- IP (VOIP). 

[0015] The system of FIG. 3 may be used to route 
calls from any of a number of sources (e.g., through the 
PSTN, Internet, cellular telephone system, cable TV 
system, radio, satellite, etc.). As used herein, a call may 
be based on a switched circuit connection (i.e., through 
the PSTN) or a packet connection (e.g., through the In- 
ternet). A switched circuit connection (also sometimes 
referred to simply as a "telephone connection" in the te- 
lephony arts) refers to a dedicated channel existing be- 
tween two parties. As used herein, a packet connection 
does not represent a physical connection , but simply the 
possession and concurrent use by two users of the other 
user's I P address for the exchange of voice information . 
[0016] Under the illustrated embodiment, calls may 
be received from customers 100-105 and be routed to 
a selected transaction processing entity (e.g., agent sta- 
tions 1 24, 1 26 or interactive voice response units (I VRs) 
128, 130) by the transaction processing system 18. 
[001 7] Switched circuit calls with customers 1 00-1 05 
placed through the ACD 134, may be routed by matrix 
switch 1 36 of the ACD 1 34 to a console 1 1 6, 1 1 8 of the 
agent 26 or to an appropriate IVR 128, 130. Calls 
through the Internet 106 may occur as voice over IP. 
When received, the host 1 32 may route the VOIP call to 
a terminal 120, 122 of an agent 26. 
[0018] A customer 100-105 may use a conventional 
telephone or cell phone and/or a computer to place/re- 
ceive a call with the transaction processing system. Al- 
ternatively, the customer 100-105 may place/receive a 
call using an interactive channel of a community anten- 
na television (CATV) system 112, land mobile radio 1 40 
or a transmission channel 142, 144 of a satellite 116. 
Where the customer 100-105 places a call using an in- 
teractive channel of a community antenna television 
(CATV) system 112, land mobile radio 140 or a trans- 
mission channel 142, 144 of a satellite 116, it may be 
assumed that such call is initiated by the entry of a target 



identifier (e.g., a telephone number of the ACD 1 34) us- 
ing a conventional touch-tone pad. 
[001 9] While the term "caller" may herein sometimes 
by used to refer to the customer 100-105 it should be 

5 understood that calls may just as well be initiated by the 
transaction processing system 1 34. For example, cus- 
tomer lists may be maintained in a database of the host 
1 32. Outgoing calls from the system 1 0 may be initiated 
to the customers 100-105 by the CPU 138 through the 

w PSTN 1 08, radio frequency (rf) transceiver 1 1 4 or by the 
host 132 through the Internet 106. 
[0020] Associated with each customer 1 00-1 05 is an 
identifier and call routing information. The identifier may 
be an identifier used for identifying the customer 

15 100-105 within a particular communication system (e. 
g., a telephone number within the PSTN 1 08, an IP ad- 
dress within the Internet 106, a customer account 
number within the CATV system 1 1 2, an electronic serial 
number (ESN) within the land mobile radio 140 or sat- 

20 ellite system 116, etc.). 

[0021] The routing information maybe used to identify 
the particular system (e.g., PSTN 108, Internet 106, 
CATV 112, land mobile radio 140, satellite 116, etc.) 
within which the identifier is to be used. In the case of 

25 the transaction processing system 134, the routing in- 
formation may simply identify the port through which the 
call is to be processed. For example, a port for an Inter- 
net call may be an Internet connection with the host 132. 
A telephone call may be processed through a first set of 

30 trunk connections 28 using a respective port of a switch 
1 36 of the ACD 1 34. A call with a cable subscriber 1 03 , 
land mobile user 104 or satellite customer 105 may be 
processed through a second set of trunk connections 
1 46 using a respective port of the switch 1 36 of the ACD 

35 134. The identifier and call routing information may, to- 
gether, be referred to herein as call associated informa- 
tion. 

[0022] Whether a call is incoming or outgoing, the dis- 
tribution of the call may be substantially the same. When 

40 the call is outgoing, the transaction processing system 
1 8 inherently knows the identity of the call target. When 
the call is incoming, the transaction processing system 
18 may determine the identity of the caller based upon 
other call associated information (e.g., a port number 

45 and AN I or IP address information in the case of the 
PSTN or Internet). By knowing the identity of a customer 
100-105, the transaction processing system 18 may 
route the call based upon the historical needs of the cus- 
tomer 100-105 or some other well-known criteria. 

so [0023] Call delivery to a transaction processing entity 
26 may be accomplished under several formats. Where 
the call is a switched circuit call, the CPU 138 selects 
an agent and delivers the call to the console 1 1 6, 1 1 8 of 
the selected agent or to the selected IVR 128, 130. The 

55 CPU 138 may send a call delivery message including 
the call associated information to the host 1 32. Where 
the host 132 is able to identify customer records, the 
host 132 may present those records to the selected 
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agent 26 at the instant of call delivery (e.g., as a screen 
pop on a terminal 120, 122 of the selected agent). 
[0024] Where a call is through the Internet, presenta- 
tion may be entirely through the terminal 1 20, 1 22 of the 
selected agent. Where the call is in the form of a web 
telephony call, the host 132 may forward the IP address 
of the requestor to the selected agent. Where the host 
132 is able to identify customer records, those records 
may be presented to the agent along with the web te- 
lephony request. 

[0025] As an alternative to the system of FIG. 3, the 
transaction processor 1 8 may be implemented (as sug- 
gested by FIG. 1) on a personal computer (PC) as a call 
processor 18 with a single connected agent 26. Where 
the call processor 18, is implemented as an ACD with a 
number of connected agents 26, it should be understood 
that each agent may serve a different organization. Fur- 
ther, even where a single agent 26 is involved, the or- 
ganization objective may differ depending upon the call 
target 12, 14. 

[0026] The system 10 may be used to serve any of a 
number of organizational objectives. As used herein, an 
organizational objective means any functional activity 
directed to achieving the goals of the organization. Fur- 
ther, it is to be understood that functional activities may 
be achieved through contacts between agents of the or- 
ganization and targets of the organization. Typically the 
outside contact includes an oral presentation by an 
agent 26 of the organization or an oral exchange be- 
tween the agent 26 and a human organizational TGT 
12, 14 regarding a product of the organization, without 
regard to who initiated the contact. 
[0027] For example, if the organization using the sys- 
tem 1 0 is a merchant, then the organizational objective 
would be sales and the contact would be between a 
salesman or solicitor 26 and a member of the general 
public (i.e., TGT 12, 14). Alternatively, if the organization 
is a charitable organization, then the organizational ob- 
jective may be collecting donations and the contact 
would, again, be through a solicitor. 
[0028] If the organization were a political party, then 
the organization objective would be winning an election 
or influencing a vote. If the organization were an envi- 
ronmentalist group, then the organizational objective 
may simply be public awareness. 
[0029] Organizational objectives may be further ad- 
vanced by adjusting a content to eliminate perceived 
cultural differences from contacts between the agents 
26 and TGTs 12, 14. It should be understood in this re- 
gard that the adjustment of a content of an oral presen- 
tation is limited only to the speech of the agent 26 of the 
organization and not to the organizational target. It 
should also be understood that, as used herein, "pres- 
entation" is intended to include an agents participation 
in a two-way conversation and is specifically not limited 
to (but could include) a one-sided sales pitch by the 
agent of the organization's message to the TGT 12, 1 4. 
[0030] Under the illustrated embodiment, as calls are 



detected, the call processor 1 8 may select different di- 
alects based upon call-associated information. Where 
the call is an incoming telephone call, the call -associat- 
ed information may be DNIS or ANI information. Where 

5 the call is an outgoing telephone call, the call-associated 
information may be the telephone number called. 
[0031] Where the call-associated information is 
based upon DNIS information, the processor 22 may se- 
lect a dialect based upon the DNIS information from the 

10 PSTN 16. For example, a particular organization may 
disseminate different telephone numbers in different 
marketing locals. A dialect may be selected based upon 
the marketing local associated with of the call. 
[0032] Similarly, where the call-associated informa- 

15 tion is ANI or an outgoing dialed number, the dialect may 
be based upon the regional identifiers (e.g., the area 
code) of the number or upon experience with the TGT 
12, 14. Using the call-associated information, an initial 
determination of dialect may be made. Following the in- 

20 itial determination of dialect, a dialect file 38, 40 (FIG. 
2) defining the dialect for the call may be retrieved based 
upon the call-associated information. The file 38, 40 
may include a set of dialect parametric values to be used 
for the call. The parametric values may define the dialect 

25 and may be predetermined for each type of call associ- 
ated information and may be stored within memory 24. 
[0033] The call content parametric values defining a 
dialect may be delineated by any of a number of speech 
attributes. Word rate may be one attribute used to define 

30 a dialect. Word choice and inflection may be other at- 
tributes. Software that may be used to define the para- 
metric values of dialects (or recreate a dialect using the 
defined parameters) may be obtained from any of a 
number of sources (e.g., http://faculty.washington.edu/ 

35 dillon/PhonResources/ PhonResources.html, http:// 
www.linguasphere.org/, etc.) 

[0034] Once the parametric values defining the dia- 
lect has been retrieved, the call processor 1 8 may con- 
nect the call to the agent 26 and begin detecting a word 

40 content of the call. To being detecting a content of the 
call, a digital signal processor (DSP) 20 (functioning as 
a sampling processor) may begin sampling a voice sig- 
nal from the agent and a voice signal from the TGT 12, 
1 4. The sampled signals from the agent 26 and TGT 12, 

45 14 may then each be converted into a voice profile within 
a Fourier processor 32 using a fast Fourier transform 
(FFT). 

[0035] Using the voice profile of the TGT 12, 14, a 
word recognition (e.g., a speech recognition) processor 

50 42 may be used to determine a word content and a di- 
alect being used by the TGT 12,14. Where the detected 
dialect of the TGT 12, 14 differs from the dialect of the 
initial determination, a new dialect file 38, 40 may be 
retrieved. The detected word content and dialect may 

55 be stored in memory 24 for later use as discussed below. 
[0036] Next, the voice profile of the agent 26 may also 
be applied to the word recognition processor 42 to de- 
tect a content of the oral presentation of the agent. The 
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content may include a word content, a word rate and a 
dominant frequency. 

[0037] Under one illustrated embodiment, the dialect 
of the agent 26 and TGT 12, 14 may each be deter- 
mined. The determination of the original dialect of the 
agent may be used to improve system performance by 
detecting and imparting additional meaning to the 
agents word choice. Alternatively, only the dialect of the 
TGT 12, 14 need be determined. Once determined, 
each word of the agent 26 may be adjusted to conform 
to the determined dialect of the TGT 12, 14. 
[0038] Where the dialect of both the agent 26 and 
TGT 12, 14 are determined, the voice profiles of the 
agent 26 may be compared with the contents of a se- 
lected one of the dialect files 38, 40 in a content proc- 
essor 44 to determine whether any modification or ad- 
justments are in order. Modification of the content of the 
agent's presentation may be performed on a word-by- 
word basis or upon an entire word stream from the agent 
based upon the content of the selected dialect file 38, 
40. 

[0039] A dialect of the agent 26 may be matched to 
the TGT 12, 14 using templates 34, 36 and/or software 
(as discussed above) designed for generating a partic- 
ular dialect. Where the dialect file 38, 40 requires or 
where the recognized dialect of theTGT 12, 1 4 suggests 
a particular dialect, the processor 22 may modify the 
agent's presentation to match that dialect. Such match- 
ing may be accomplished by first recognizing each word 
from the agent 26 and then selecting a template for the 
corresponding word in the particular dialect required. 
The template may then be used as a means for substi- 
tuting a synthesized word of the proper dialect for the 
recognized word spoken by the agent 26. 
[0040] The templates may be implemented under any 
of a number of different methods. For example, a word 
template may be based upon recordings of words spo- 
ken under different dialects. As above, the words of the 
different dialects may be sampled and converted using 
a FFT into word profiles. A dominant frequency of the 
profiles may be shifted up or down to match the agents 
normal tone of voice and converted back into an audio 
presentation using an inverse FFT. 
[0041] Regionally specific words may be added or 
substituted where required by the dialect file 38, 40. For 
example, some words (e.g., license plate and tag) have 
different meanings in different locals. Regionally specif- 
ic words may also be used as indicia of dialect when 
detected in the speech of a TGT 12, 14 and agent 26. 
Regionally specific words may also be inserted into the 
speech of the agent 26 to achieve the intended dialect. 
[0042] Under the illustrated embodiment, the initial 
determination of dialect may be changed based upon 
the voice profiles from the TGT 12, 14 during the ex- 
change between the agent 26 and TGT 12, 14. Where 
the determined dialect is later determined to be different 
than the initially determined dialect, then a different di- 
alect file 38, 40 may be retrieved and substituted as nec- 



essary for adjusting the dialect of the agent 26. 
[0043] A specific embodiment of a method and appa- 
ratus of adjusting a dialect of an oral presentation ac- 
cording to the present invention has been described for 
s the purpose of illustrating the manner in which the in- 
vention is made and used. It should be understood that 
the implementation of other variations and modifications 
of the invention and its various aspects will be apparent 
to one skilled in the art, and that the invention is not lim- 
10 ited by the specific embodiments described. Therefore, 
it is contemplated to cover the present invention any and 
all modifications, variations, or equivalents that fall with- 
in the true spirit and scope of the basic underlying prin- 
ciples disclosed and claimed herein. 



1 . A method of adjusting a dialect for an oral presen- 
ce tation provided by an agent of an organization to a 

human target of the organization through a commu- 
nications network, such method comprising the 
steps of: 

25 determining a dialect to be used by the agent 

for communicating with the target; 
modifying the dialect of the oral presentation of 
the agent for communicating with the target 
based upon the determined dialect; and 
30 presenting the oral presentation in the modified 

dialect to the target. 

2. The method of selecting a dialect as in claim 2 
wherein the step of determining the dialect further 

35 comprises retrieving call associated information re- 
lating to the target. 

3. The method of selecting a dialect as in claim 1 
wherein the step of modifying the content of the oral 

40 presentation further comprises recognizing a spo- 
ken word content. 

4. The method of selecting a dialect as in claim 3 
wherein the step of detecting a spoken word content 

45 of the oral presentation further comprises compar- 
ing the recognized spoken word content with a set 
of verbal templates. 

5. The method of selecting a dialect as in claim 4 
50 wherein the step of comparing the recognized spo- 
ken word content with a set of verbal templates fur- 
ther comprises selecting a template of the set of 
templates with a nearest match. 

55 6. The method of selecting a dialect as in claim 4 
wherein the step of comparing the recognized spo- 
ken word content with a set of verbal templates fur- 
ther comprises substituting an equivalent word with 
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the determined dialect. 

7. An apparatus for selecting a dialect for an oral pres- 
entation provided by an agent of an organization to 15. 
a human target of the organization through a com- 5 
munications system, such apparatus comprising: 

means for determining a dialect to be used by 
the agent for communicating with the target; 16. 
means for modifying a dialect of the oral pres- 10 
entation of the agent based upon the deter- 
mined dialect; and 

means for presenting the modified oral presen- 1 7. 

tation to the target. 

15 

8. The apparatus for selecting a dialect as in claim 7 
wherein the means for determining the dialect fur- 
ther comprises means for retrieving call associated 
information relating to the target. 

20 

9. The apparatus for selecting a dialect as in claim 7 
wherein the means for modifying the dialect of the 
oral presentation further comprises means for rec- 
ognizing a spoken word content. 

25 

10. The apparatus for selecting a dialect as in claim 9 
wherein the means for detecting a spoken word 
content of the oral presentation further comprises 
means for comparing the recognized spoken word 
content with a set of verbal templates. 30 

1 1 . The apparatus for selecting a dialect as in claim 1 0 
wherein the means for comparing the recognized 
spoken word content with a set of verbal templates 
further comprises means for selecting a template of 35 
the set of templates with a nearest match. 

12. The apparatus for selecting a dialect as in claim 9 
wherein the means for recognizing the spoken word 
content further comprises means for substituting an , 40 
equivalent word with the determined dialect. 

1 3. An apparatus for selecting a dialect for an oral pres- 
entation provided by an agent of an organization to 

a human target of the organization through a com- *5 
munications system, such apparatus comprising: 

a call analyzer adapted to determine a dialect 
to be used by the agent for communicating with 
the target; so 
a voice processor adapted to modify a dialect 
of the oral presentation from the agent based 
upon the determined dialect; and 
a voice synthesizer adapted to present the 
modified oral presentation to the target. 55 

14. The apparatus for selecting a dialect as in claim 13 
wherein the call analyzer further comprises a tele- 



phone system interface adapted to receive call as- 
sociated information relating to the customer. 

The apparatus for selecting a dialect as in claim 14 
wherein the voice processor further comprises a 
word recognition module adapted to recognize a 
spoken word content of the oral presentation. 

The apparatus for selecting a dialect as in claim 15 
wherein the voice recognizer further comprises a 
plurality of verbal templates. 

The apparatus for selecting a dialect as in claim 15 
wherein the voice recognizer further comprises a 
comparison processor adapted to compare the rec- 
ognized spoken word content with the plurality of 
verbal templates. 
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